Quantile approximation for robust statistical estimation

نویسندگان

  • David M. Mount
  • Nathan S. Netanyahu
  • Christine D. Piatko
  • Ruth Silverman
  • Angela Y. Wu
چکیده

Given a set P of n points in R d , a fundamental problem in computational geometry is concerned with nding the smallest enclosing \range" of P. Well known instances of this problem include nding, e.g., the smallest enclosing box 10], smallest enclosing simplex 11], minimum volume ball 17, 20], (2-D) smallest enclosing ellipsoid 16, 20, 2, 14], and minimum volume annulus 1, 8, 5]. In this paper, we consider the following generic variant: Given a set of n points in R d , nd the smallest range in question that contains (at least) a certain quantile (up to 50%) of the data. Although this variant has been studied to some extent in computational geometry | instances include enclosing k points by a circle 7, 4] and nding the smallest axis parallel rectangle enclosing k points 15] | the algorithms proposed deal mainly with speciic cases and their running times are relatively high (O(n 2 log n) and O(n 3), respectively, for the minimum enclosing circle and axis parallel rectangle in the plane). stems from the growing need for eecient data analysis techniques that are robust to outlying/noisy observations. While such data can be handled successfully due to robust statistical estimators, the eecient computation of these estimators continues to pose a formidable algorithmic challenge. The basic measure of the robustness of an estima-tor is its breakdown point, that is, the fraction (up to 50%) of outlying data that can corrupt the estimator (see, e.g., 13] for an exact deenition). For example , Rousseeuw's least median-of-squares (LMS) regression estimator 12] (which is equivalent to nding the narrowest hyperstrip contanining at least 50% of the data) is among the best known 50% breakdown-point estimators. The best exact algorithm known for this problem runs in O(n d+2 log n) time 19]. Because of this high running time, a simple O(n logn) Monte Carlo algorithm (for xed d) is commonly used 13]. However, the latter provides no guarantee of accuracy (even probabilistic) unless the data set satisses certain assumptions. 1 Other robust estimators considered in this paper are the minimum volume disk (e.g., the minimum volume ellipsoid (MVE)) and the minimum volume annulus (MVA) enclosing (at least) half of the points in P. As in the case of LMS, algorithms known for these estimators have either high complexity or they provide no guarantee of accuracy due to their Monte Carlo nature 13]. In an …

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Robust Distributed Estimation Algorithm under Alpha-Stable Noise Condition

Robust adaptive estimation of unknown parameter has been an important issue in recent years for reliable operation in the distributed networks. The conventional adaptive estimation algorithms that rely on mean square error (MSE) criterion exhibit good performance in the presence of Gaussian noise, but their performance drastically decreases under impulsive noise. In this paper, we propose a rob...

متن کامل

Robust Estimation of Conditional Mean by the Linear Combination of Quantile Regressions

In this paper, we propose a new robust estimator for regression problems in the form of the linear combination of quantile regressions. The proposed robust regression estimator is helpful for the conditional mean estimation especially when the error distribution is asymmetric or/and heteroscedastic, where conventional robust regressions yield considerable bias to the conditional mean. First we ...

متن کامل

Quantile Approximation for Robust Statistical Estimation and k-Enclosing Problems

Given a set P of n points in Rd, a fundamental problem in computational geometry is concerned with finding the smallest shape of some type that encloses all the points of P . Well-known instances of this problem include finding the smallest enclosing box, minimum volume ball, and minimum volume annulus. In this paper we consider the following variant: Given a set of n points in Rd, find the sma...

متن کامل

Instrumental Variables Estimation and Weak-Identification-Robust Inference Based on a Conditional Quantile Restriction

Extending the L1-IV approach proposed by Sakata (1997, 2007), we develop a new method, named the ρτ -IV estimation, to estimate structural equations based on the conditional quantile restriction imposed on the error terms. We study the asymptotic behavior of the proposed estimator and show how to make statistical inferences on the regression parameters. Given practical importance of weak identi...

متن کامل

Robust quantile estimation and prediction for spatial processes

In this paper, we present a statistical framework for modeling conditional quantiles of spatial processes assumed to be strongly mixing in space. We establish the L1 consistency and the asymptotic normality of the kernel conditional quantile estimator in the case of random fields. We also define a nonparametric spatial predictor and illustrate the methodology used with some simulations.

متن کامل

On the Robust Estimation of Power Spectra, Coherences, and Transfer Functions

Robust estimation of power spectra, coherences, and transfer functions is investigated in the context of geophysical data processing. The methods described are frequency-domain extensions of current techniques from the statistical iterature and are applicable in cases where section-averaging methods would be used with data that are contaminated by local nonstationarity or isolated outliers. The...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998